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Abstract. In this paper we propose a new approach to estimation of the tail 
exponent in financial stock markets. We begin the study with the finite sample be- 
havior of the Hill estimator under a— stable distributions. Using large Monte Carlo 
simulations we show that the Hill estimator overestimates the true tail exponent 
and can hardly be used on samples with small length. Utilizing our results, we 
introduce a Monte Carlo-based method of estimation for the tail exponent. Our 
proposed method is not sensitive to the choice of tail size and works well also on 
small data samples. The new estimator also gives unbiased results with symmet- 
rical confidence intervals. Finally, we demonstrate the power of our estimator on 
the international world stock market indices. On the two separate periods of 2002- 
2005 and 2006-2009 we estimate the tail exponent. 
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I. Introduction 

Statistical analysis of financial data has been of vigorous interest in recent years, 
mainly among the physics community (Mantegna and Stanley, 2000; Bouchaud and 
Potters, 2001; Buchanan, 2002; Mantegna et al., 1999; Plerou et al, 2000; Stanley 
et al., 2000; Stanley, 2003). One of the main reasons driving the research is the use 
of established statistical characteristics to better describe and understand real-world 
financial data. 

Dynamics of financial markets is the outcome of large number of individual decisions 
based on heterogeneous information. Financial returns representing the interaction 
of the market participants have been assumed to be Normally distributed for a long 
time. The strongest argument supporting this assumption is based on the Cen- 
tral Limit Theorem, which states that the sum of a large number of independent, 
identically distributed variables from a finite-variance distribution will tend to be 
normally distributed. However, financial returns showed to have heavier tails, which 
is a possible source of infinite variance. 

Mandelbrot (1963) and Fama (1965) proposed stable distributions as an alternative 
to the Gaussian distribution model. Stable distributions were introduced by Levy 
(1925), who investigated the behavior of sums of independent random variables. Al- 
though we know other heavy-tailed alternative distributions (such as student's t, 
hyperbolic or normal inverse Gaussian), stable distributions are attractive for re- 
searchers as they are supported by the generalized Central Limit Theorem. The 
theorem states that stable laws are the only possible limit distributions for properly 
normalized and centered sums of independent, identically distributed random vari- 
ables. A sum of two independent random variables having a Levy stable distribution 
with parameter a is again a Levy stable distribution with the same parameter a. 
However, this invariance property does not hold for different values of a. Observed 
stock market prices are argued to be the sum of many small terms, hence a stable 
model should be used to describe them. When a < 2, the variance of the stable 
process is infinite and the tails are asymptotically equivalent to a Pareto law, i.e., 
they exhibit power-law behavior. Stable distributions have been proposed as a model 
for many types of physical and economic systems as they can accommodate fat tails 
and asymmetry and fit the data well. Examples in finance and economics are given 
in Mandelbrot (1963), Fama (1965), Embrechts et al. (1997) or Rachev and Mittnik 
(2000). 

There have, however, been many applications of Levy stable distributions to empiri- 
cal data sets which could raise doubts about the correctness of the tail estimate (Lux, 
1996; Voit, 2005; Podobnik et al., 2000). There is a significant difference between the 
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value of the estimated a (based on the whole data set) and the estimated tail expo- 
nent. The tail exponent is estimated only on an arbitrarily chosen part of the data 
(Hill, 1975; Weron, 2001). Since extreme observations of prices on financial markets 
are of great importance, this problem deserves further research. If a is underesti- 
mated, the occurrence of extreme events is overestimated. Weron (2001) shows that 
the estimated tail exponent is very sensitive to changes in parameters and to the 
size of the data set, hence the estimates can be highly misleading. Simulations show 
that a large data set (10 6 ) is needed for identification of the true tail behavior. The 
logical step is to use high-resolution data analysis. Lux (1996) was one of the first 
to use high-frequency data, doing so for analysis of the German stock market index. 
Several studies concerning estimation of stable distributions followed (Mantegna and 
Stanley, 2000; Dacorogna et al., 2001; Voit, 2005). 

In our paper, we append an analysis of the finite sample properties of the Hill esti- 
mator to the discussions. Moreover, we introduce a Monte Carlo-based tail exponent 
estimation method. In the first part, we briefly discuss the basics of the Hill esti- 
mator as well as stable distributions and their tail behavior. In the second part, we 
provide the finite sample properties of the Hill estimator and discuss the implications 
for its use on real- world data. In the third part, we utilize the results and propose a 
tail exponent estimation method based on Monte Carlo simulations. Finally, we il- 
lustrate the power of our method of estimation on leading world stock market indices 
and conclude. 



2. Estimation of the tail exponent 

The simplest method of estimating the tail exponent a is log-log linear regression. 
This method is very sensitive to the sample size and the choice of the number of 
observations used in the regression. Weron (2001) shows that it also yields biased 
estimates and cannot be used for financial applications. Popular method for esti- 
mating the tail exponent is also rank-frequency plot. Gabaix and Ibragimov (2009) 
proposed an extension of the rank-frequency plot with lower bias for small samples. 
Maximum likelihood method of estimation still belongs to most reliable techniques. 
For example Clauset et al. (2009) combine maximum-likelihood fitting with goodness- 
of-fit tests based on Kolmogorov-Smirnov statistic and likelihood ratios. Recently, 
many authors have studied the return intervals between consecutive price fluctuations 
above some volatility threshold. Podobnik et al. (2009) found a simple relationship 
between the mean return intervals f q and the threshold q, f q oc q a , where a is power- 
law exponent. As authors show, this approach can also be used for estimation of the 
power-law exponent for financial data. 
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Another widely used tail exponent estimator, equivalent to maximum-likelihood fit- 
ting, based on order statistics is proposed in Hill (1975). Pickands (1975) and Dekkers 
et al. (1989) show other variations of the Hill estimator. Mittnik et al. (1998) provide 
a modification of the Pickands estimator using high-order approximation. A quantile 
method is used by McCulloch (1996). For more detailed discussion on tail estimators 
see Embrechts et al. (1997) and Resnick (2007). 

The Hill estimator tends to overestimate the tail exponent of a stable distribution if 
the value of a is close to two and the sample size is not very large. For a more detailed 
discussion see Weron (2001) and Embrechts et al. (1997). Several researchers have 
used misleading estimators of the tail exponent a to conclude that various data sets 
had a > 2, i.e., the data is not stable. First of all, let us briefly introduce the Hill 
estimator. 

2.1. Hill estimator. The most popular method for estimating the tail exponent 
a is the Hill estimator (Hill, 1975). The Hill estimator is used to estimate the 
tail exponent only, therefore it does not assume a parametric form for the entire 
distribution function. 

Let's suppose Xi,X 2 , . . . ,X n is a sequence of i.i.d. random variables with the dis- 
tribution function F(x). Furthermore, let's assume that 1 — F(x) has a regularly 
varying upper tail with coefficient —a. 

(1) P(X >x) = l-F(x) = x~ a L(x), x>0 

where the function L(x) is slowly varying at infinity 1 (for more details see Resnick 
(2007)). 

Equation 1 indicates that the right tail of the distribution function F(x) has the 
same asymptotic properties as the tail of the Pareto distribution, see (Wagner and 
Marsh, 2004). 

Let us define the order statistics Xm > X( 2 ) > . . . > Xi n ). The Hill estimator of the 
tail exponent a is defined as: 

(l k 

(2) a H = I - ^2 lo S x (n-i+i) ~ logX (n _ fc) 

V i=i 

where k is a truncation (or smoothing) parameter which defines a subsample used 
for the estimation, k < n. 

1 If eq. 1 is satisfied, the distribution function F(x) belongs to the maximum domain of attraction 
of (j> a , F € MDA((j) a ), where <f> a (x) = e~ x , x > 0, a > 0. For a more detailed treatment of the 
extreme value theory see Embrechts et al. (1997) 
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Many authors have studied the asymptotical properties of the Hill estimator. Ac- 
cording to Mason (1982) the Hill estimator is weakly consistent if 

k 

(3) k — > oo, > as n — > oo. 

n 

Furthermore, Goldie and Smith (1987) proved the asymptotic normality of the Hill 
estimator, i.e., 

(4) Vki&Jj 1 -a' 1 ) ~ iV(0,a- 2 ). 

Consequently, an is also approximately normal with mean a and variance a 2 /k. 
Moreover, (Hsing, 1991) shows that the Hill estimator is asymptotically quite robust 
with respect to deviations from independence. For a more detailed discussion of the 
properties of the Hill estimator see Resnick (2007), Embrechts et al. (1997). 

2.2. Choice of optimal k in the Hill estimator. In the classical approach, there 
is considerable difficulty in choosing the right value of the truncation parameter k, 
since it can influence the accuracy of the estimate significantly. The methods of 
choosing k from the empirical data set are very often based on a trade-off between 
the bias and the variance of the Hill estimator, k has to be sufficiently small to ensure 
the observations > X( 2 ) > . . . > X^ still belong to the tail of the distribution. 
On the other hand, if k is too small, the estimator lacks precision. 

There are several methods for choosing the "optimal" value of k. The first possibility 
is to make a Hill plot, where 6ch is plotted against k, and look for a region where the 
graph has fairly stable behavior to identify the optimal value of the order statistics 
k. Alternatively, there are other methods for choosing the optimal k, such as the 
bootstrap approach (Hall, 1990). 

In our simulations later in this paper we show that choosing k from the Hill plot 
is very inaccurate and for higher values of a it is almost impossible to set its right 
value. Resnick (2007) clearly illustrates why this technique very often leads to a "Hill 
horror plot" from which it is not possible to discern the correct value of k. Using the 
Hill estimates from simulated random variables on a standard symmetric Levy-stable 
distribution for different values of k, we also clearly demonstrate that choosing the 
"correct" A; is a very difficult task. Furthermore, for a short dataset (< 10 6 ) it is 
usually impossible to find out the right value of k, especially when a is close to 2. 
Unfortunately, this setting is very often the case when we analyze empirical financial 
market data. 

To overcome the problem of choosing the optimal k we introduce a new estimation 
method based on comparison of the estimate of the empirical dataset with estimates 
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on a pre-simulated random variable from the standard Levy distribution for k on the 
interval [1% — 20%]. The main advantage of our approach is that it does not assume 
the smoothing parameter k to be known. 

3. Finite sample properties of the Hill estimator for different 

heavy tails 

We would like to study the finite sample behavior of the Hill estimator in detail for 
stable distributions with different heavy tails first. Let us start with an introduction 
to stable distributions. 

3.1. Stable distributions. Stable distributions are a class of probability laws with 
appealing theoretical properties. Their application to financial modeling comes from 
the fact that they generalize the Gaussian distribution, which does not describe well- 
known stylized facts about stock market data. Stable distributions allow for heavy 
tails and skewness. In this paper, we provide a basic idea about stable distributions. 
Interested readers can find the theorems and proofs in Nolan (2003), Zolotarev (1986) 
and Samorodnitsky and Taqqu (1994). 

The reason for the term stable is that stable distributions retain their shape up to 
scale and shift under addition: if X, X±, X 2 ,..., X n are independent, identically 
distributed stable random variables, then for every n 

(5) X 1 + X 2 + ... + X n = c n X + d n 

for constants c n > and d n . Equality (=) here means that the right-hand and left- 
hand sides have the same distribution. Normal distributions satisfy this property: 
the sum of normals is normal. In general, the class of all laws satisfying (5) can be 
described by four parameters, (a,/3,j,5). Parameter a is called the characteristic 
exponent and must be in the range a G (0, 2]. The coefficients c n are equal to n l / a . 
Parameter /3 is called the skewness of the law and must be in the range — 1 < /3 < 1. 
If (3 = 0, the distribution is symmetric, if /3 > it is skewed to the right, and if 
(3 < it is skewed to the left. While parameters a and (5 determine the shape of the 
distribution, 7 and 5 are scale and location parameters, respectively. 

Due to a lack of closed form formulas for probability density functions (except for 
three stable distributions: Gaussian, Cauchy, and Levy) the a-stable distribution 
can be described by a characteristic function which is the inverse Fourier transform 
of the probability density function, i.e., <p{u) = E exp(raX) . 

A confusing issue with stable parameters is that there are multiple parametrizations 
used in the literature. Nolan (2003) provides a good guide to all the definitions. In 
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(6) <j>(u) 



this paper, we will use Nolan's parametrization, which is jointly continuous in all 
four parameters. A random variable X is distributed by S(a, f3, 7, 5) if it has the 
following characteristic function: 

exp(-7 Q |u| Q [l + ^(tan^)(signu)(| 7 M| 1 - Q - 1)] + iSu) a ^ 1 
exp(— 7|tt| [1 + i/J~(signu) ln( 7 |w|)] + iSu) a = 1 

There are only three cases where a closed-form expression for density exists and 
we can verify directly if the distribution is stable - the Gaussian, Cauchy, and 
Levy distributions. Gaussian laws are stable with a = 2 and (3 = 0. More pre- 
cisely, iV(0,(x 2 ) = S(2,0,a/V2,0). Cauchy laws are stable with a = 1 and /3 = 0, 
Cauchy(7, 5) = S(l, 0, 7, 5); and finally, Levy laws are stable with a = 1/2 and (3 = 1; 
Levy(7,<5) = 5(1/2, 1,7,7 + Nolan (2003) shows these examples in detail. 

For all values of parameter a < 2 and — 1 < (3 < 1, stable distributions have 
two tails that are asymptotically power laws. The asymptotic tail behavior of non- 
Gaussian stable laws for X ~ S(a, (3, 7, 5) with a < 2 and — 1 < (3 < 1 is defined as 
follows: 

(7) \imx a P(X > x) = c Q (l + /3) 7 a 

x— >oo 

(8) limx a P(X< -x) = c a (1-/3)7°, 
where 

(9) c a = sin (jy^ T (a) /ix. 

If the data is stable, the empirical distribution function should be approximately a 
straight line with slope —a in a log- log plot. 

A negative aspect of non-Gaussian stable distributions (a < 2) is that not all mo- 
ments exist. 2 The first moment EX is not finite (or is undefined) when a < 1. On 
the other hand, when 1 < a < 2 the first moment is defined as 

7TQ! 

(10) EX = fi = 5 - /3 7 tan— . 

Non-Gaussian stable distributions do not have finite second moment. It is also 
important to emphasize that the skewness parameter f3 is different from the classical 
skewness parameter used for the Gaussian distribution. It cannot be defined because 
the second and third moments do not exist for non-Gaussian stable distributions. The 
kurtosis is also undefined, because the fourth moment does not exist either. 

The characteristic exponent a gives important information about financial market 
behavior. When a < 2, extreme events are more probable than for the Gaussian 



2 It is possible to define a fractional absolute moment of order p, where p is any real number. For 
< p < a, E \X\ is finite, but for p>a,E \X\ P = 00 (Nolan, 2003). 
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distribution. From an economic point of view some values of parameter a do not 
make sense. For example, in the interval < a < 1 the random variable X does 
not have a finite mean. In this case, an asset with returns which follow a stable law 
with < a < 1 would have an infinite expected return. Thus, we are looking for 
1 < a < 2 to be able to predict extreme values more precisely than by the Gaussian 
distribution. 

3.2. Research design. The simulations are constructed so as to show how the Hill 
estimator behaves for different heavy tails data. For this purpose we use the Levy sta- 
ble distribution depending on parameters (a, (3, 7, 5) and set them to (a, 0, a/2/2, 0), 
where 1.1 < a < 2 with a step of 0.1. For each parameter a, 1,000 time series with 
lengths from 10 3 to 10 6 are simulated and the tail exponent is estimated using the 
Hill estimator for k < 1% and k e (1%,20%] separately. 

In other words, we simulate the grid of different a for different series lengths. For 
each position in the grid, we estimate the tail exponent using the Hill estimator. 
This allows us to study the finite sample properties of the Hill estimator for different 
length series, different tail exponents and different k. 

3.3. Results from Monte Carlo simulations. The biggest problem with the op- 
timal choice of k is that a is not known, so we cannot really choose the optimal k. 
In our simulation, we show the finite sample properties of the Hill estimator k < 1% 
and k e (1%, 20%] separately. 

3.3.1. Hill estimation for k > 1%. We begin the simulations with a time series length 
of 10 3 , which is a usual sample length for real- world financial data, i.e., it equates to 
approximately 4 years of daily returns of a stock. Figure 1 shows the 95% confidence 
intervals 3 of the Hill estimate for different a for all k. It can be seen that it is very 
difficult to statistically distinguish between different tail exponents a and it is very 
unclear which optimal k we should choose. 

Figure 2 shows much more precise results for a time series of length 10 6 . This allows 
for narrow confidence intervals, but even with this exact result we can see that the 
Hill estimator is problematic as it overestimates the value of the tail exponent and 
we cannot pick the optimal k even from this large grid of simulated data. The only 
way of estimating the tail exponent would be to pick k, estimate the tail exponent, 
and compare it to the grid of simulated confidence intervals. But we would still need 
at least a 10 6 sample size to achieve an exact result. As we can see in Figure 1, we 
cannot get a statistically significant estimate on data with length 10 3 . Table 1 shows 



3 A11 confidence intervals are computed from the asymptotic normality of the sample. 
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the optimal value of k, where the Hill estimator gives a good estimate of the tail 
exponent a 4 . 



n 












a 










1.1 


1.2 


1.3 


1.4 


1.5 


1.6 


1.7 


1.8 


1.9 


2 


10 3 


9.11% 


15.92% 


17.28% 


18.53% 


20.42% 


20.84% 


21.15% 


21.78% 


22.20% 


21.88% 


10 4 


8.88% 


13.55% 


17.17% 


18.73% 


19.95% 


20.83% 


21.38% 


21.71% 


21.97% 


22.01% 


10 5 


8.04% 


13.73% 


16.84% 


18.78% 


20.04% 


20.80% 


21.35% 


21.73% 


21.98% 


22.10% 


10 6 


8.29% 


13.60% 


16.80% 


18.74% 


20.00% 


20.80% 


21.35% 


21.73% 


21.98% 


22.15% 



Table 1. Optimal k for various sample lengths n from 10 3 to 10 



The figures suggest that the Hill estimator does not overestimate the tail exponent 
on the 1% tail so strongly. Thus, we repeat the exercise for k < 1%. 

3.3.2. Hill estimation for k < 1%. Figures 3, 4, 5, and 6 show the behavior of the 
Hill estimator on generated datasets with length 10 3 up to 10 6 . The results suggest 
that it makes no sense to use the Hill estimator for tail exponent estimation even on 
the 10 5 dataset. For example, a of 1.2,1.3,1.4,1.5, and 1.6 cannot be statistically 
distinguished from each other. Only the dataset with length 10 6 shows better results. 
Using the simulations on such a long dataset, we can choose the optimal range of 
k. More exactly, the optimal k interval is such where the estimated value of the 
tail exponent does not deviate from its theoretical value by more than 5%. Table 2 
shows the optimal sets of k where the Hill estimator was a good estimate of the tail 
exponent. 



a Range 



1.2 


0.08% 


-1% 


1.3 


0.1% - 


0.35% 


1.4 


0.06% - 


0.22% 


1.5 


0.15% - 


0.25% 


1.6 


0.01% - 


0.12% 


1.7 


0.03% - 


0.12% 


1.8 


0.02% - 


0.08% 


1.9 


0.02% - 


0.06% 


2 


0.005% - 


- 0.03% 



Table 2. Optimal range of k for series length 10 6 for k < 1% 



Optimal value of k is such k for which the difference of the estimated and simulated a is minimal. 
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Figure 1. Hill estimate of the tail exponent on the l%-20% tail, 10 3 length 
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Figure 2. Hill estimate of the tail exponent on the l%-20% tail, 10 6 length 
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Figure 3. Hill estimate of the tail exponent on the < 1% tail, 10 3 length 
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Figure 4. Hill estimate of the tail exponent on the < 1% tail, 10 4 length 
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Figure 5. Hill estimate of the tail exponent on the < 1% tail, 10 5 length 
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Figure 6. Hill estimate of the tail exponent on the < 1% tail, 10 6 length 
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In order to estimate the true tail exponent, we need a dataset with length of at 
least 10 6 , otherwise inference of the tail exponent may be strongly misleading and 
rejection of the Levy stable regime not appropriate. For the sake of clarity we have 
to note that the results for the a = 2 should be interpreted with caution as it is the 
special case of Gaussian distribution which does not have heavy tails. Thus the Hill 
estimator is not appropriate in this case. 

In the next section, we develop a Monte Carlo-based tail exponent estimator which 
deals with all of these problems. 

4. Tail exponent estimator based on Monte Carlo simulations 

After the study of the finite sample properties of the Hill estimator, we utilize the 
results and introduce our Monte Carlo-based estimator. The Monte Carlo technique 
provides an attractive method of building exact tests from statistics whose finite 
sample distribution is intractable but can be simulated, thus it can be utilized in our 
problem. 

We construct our Monte Carlo-based tail exponent estimator as follows. 

(1) Generate 1,000 i.i.d. a— stable distributed random variables X a ° of length 
n, i.e. ^2°, • • • , x"° ~ S(a , 0, v2/2, 0), for each a parameter from the 
range [1.01, 2] with step 0.01. 

(2) For each X a °, estimate the tail exponent using the Hill estimator for all k 
from the interval (1% — 20%), i.e. a ao ^. 

(3) From Monte Carlo simulations compute the expected value E[a ao ^} of the 
Hill estimator for all k and all a^. 

(4) Using the Hill estimator estimate the tail exponent a em p,k on an empirical 
dataset of length n for all k from the interval (1% — 20%). 

(5) Our Monte Carlo-based estimator &mc is defined as: 



In other words, we simulate random variables X of length n for 100 different a's 
(representing the tail parameter) 1,000 times. 5 On these random variables we esti- 
mate the tail exponent using the Hill estimator on the (1% — 20%) tail (as we have 



(11) 




a 6[1.01,2] 



5 A more exact confidence interval can be obtained by generating more random variables 
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Figure 7. Plots of 99%, 95%, and 90% quantiles of our Monte Carlo- 
based tail exponent estimator based on 100 simulations. 

shown in the previous section, the Hill estimator does not really work on k < 1% for 
series of lengths < 10 6 ). Using the simulated variables we get the expected value of 
the Hill estimator for all k and all o.q. 

After we have obtained the behavior of the Hill estimator for a large grid of different 
tail exponents and k, we can estimate the tail exponent on the empirical dataset. 
Thus, we estimate the tail exponent of empirical data using the Hill estimator for 
all values of k. As the last step, we minimize the loss function which is defined by 
Equation 11 through the whole simulated grid of ao- Our Monte Carlo-based tail 
exponent estimate &mc is the value for which the distance between E[a ao ,k] and 
«em P ,fc is minimal (Equation 11). 

4.1. Properties of &mc- I n this section we study the finite sample properties of 
our estimation method. For this purpose, we will again use the Monte Carlo pro- 
cedure, which will provide us the statistical inference for our estimator. We simu- 
late independent identically a— stable distributed random variables of length n from 
S(a, 0, V2/2, 0) with varying a G [1.01, 2] with step 0.01. For each simulated sample, 
we apply our method and estimate the tail exponent. This Monte Carlo simulation 
allows us to derive the finite sample properties of our estimator, so we can get the 
confidence interval of the estimates. 

Figure 7 plots the 99%, 95%, and 90% quantiles of our tail exponent estimator based 
on 100 simulations. Table 3 provides exact simulated confidence intervals for several 
values of a. 
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1.49 


a = 


1.4 


1.23 


1.26 


1.27 


1.39 


1.54 


1.56 


1.58 


a = 


1.5 


1.34 


1.35 


1.37 


1.49 


1.61 


1.63 


1.66 


a = 


1.6 


1.38 


1.48 


1.50 


1.60 


1.77 


1.80 


1.88 


a = 


1.7 


1.55 


1.56 


1.58 


1.69 


1.79 


1.83 


1.87 


a = 


1.8 


1.62 


1.62 


1.66 


1.79 


1.88 


1.89 


1.96 


a = 


1.9 


1.77 


1.77 


1.78 


1.89 


1.98 


2 


2 


a = 


2 


1.94 


1.95 


1.95 


2 


1.99 


2 


2 



Table 3. Quantiles of our estimation method. 



In comparison with the Hill estimator, or any other tail exponent estimator, our 
method has several advantages. The largest advantage is that the method is not 
sensitive to the choice of k, unlike most of the other estimation methods discussed 
in the previous text. Our method also works well for smaller samples, i.e. 10 3 , as it 
yields much narrower confidence intervals compared to the Hill estimator. 

5. Empirical study 

To illustrate the power of our tail exponent estimation method, we employ empirical 
data. We use the daily returns of the following stock market indices: the German 
DAX 30, the U.S. S&P 500 and Dow Jones Industrial 30 (DJI), the London FTSE 
100, the Nikkei 225, and the Singapore Straits Times from the beginning of 2002 to 
the end of 2009. More precisely, we choose the last 2,000 observations for each index 
and divide them into two equal periods, 2002-2005 and 2006-2009, each containing 
1,000 observations. This way, we can compare different world stock market indices 
and their behavior before and during the current financial crisis . 

Figures 9 and 10 show the histograms of the tested indices. Figure 9 shows the 
histograms for the first period 2002-2005, while Figure 10 shows the histograms for 
the second period 2006-2009. All the data are leptokurtic, showing excessive peaks 
around the mean and thicker tails than those of the normal density. Moreover, the 
data from the second period tend to show even more excessive kurtosis and heavier 
tails than the ones from the first period. This is probably caused by the large 
price movements that occurred during the deep financial turmoil in the years 2007- 
2009. By contrast, the first period of stable growth is closer to the standard normal 
distribution. 



MONTE CARLO-BASED TAIL EXPONENT ESTIMATOR. 



16 



We use our Monte Carlo method to estimate the tail exponents for all the datasets. 
Before we start the estimations, we set the length n of the simulated series from our 
algorithm to 1,000 so that it corresponds to the length of the empirical dataset and 
we can statistically compare the estimates. Table 4 shows the tail exponent estimates 
with confidence intervals. Let us further demonstrate the estimation method on the 











Empirical Confidence Intervals 










0.5% 


2.5% 


5% 


OLMC 


95% 


97.5% 


99.5% 




zUUz- 
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ZUUO 
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l.OU 

1.47 


1.53 
1.53 


1.55 
1.56 


1.64 
1.70 


1.77 
1.79 


1.8 
1.81 


l.OO 

1.87 


U.S. S&P500 


2002- 
2006- 


-2005 
-2009 


1.53 
1.35 


1.57 
1.38 


1.58 
1.4 


1.74 
1.54 


1.85 
1.63 


1.88 
1.70 


1.94 
1.72 


U.S. DJI 30 


2002- 
2006- 


-2005 
-2009 


1.50 
1.29 


1.53 
1.38 


1.55 
1.4 


1.69 
1.54 


1.83 
1.66 


1.87 
1.70 


1.89 
1.77 


London FTSE 100 


2002- 
2006- 


-2005 
-2009 


1.34 
1.50 


1.42 
1.55 


1.43 
1.55 


1.57 
1.69 


1.69 
1.82 


1.69 
1.87 


1.71 
1.91 


Japan Nikkei 225 


2002- 
2006- 


-2005 
-2009 


1.79 
1.58 


1.81 
1.60 


1.84 
1.62 


1.93 
1.77 


1.98 
1.88 


2 

1.89 


2 

1.89 


Singapore Straits 


2002- 
2006- 


-2005 
-2009 


1.72 
1.47 


1.72 
1.50 


1.74 
1.52 


1.85 
1.66 


1.94 
1.78 


1.95 
1.81 


1.97 
1.87 



Table 4. Exact confidence intervals (quantiles) of the estimated tail 



exponent a. 
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Figure 8. Plot of Hill estimates through different k for the FTSE 
index (second period of 2006-2009) for a = 1.55, a = 1.69, and a = 
1.87, which are our olmc with the 0.025% and 97.5% quantiles. 

example of the London FTSE 100 index for the second period. Figure 8 plots the 
empirical Hill estimates through different k and compares them to 1,000 simulations 
of a = 1.55, a = 1.69, and a = 1.87 (these are our Monte Carlo estimates with 
the 0.025% and 97.5% quantiles). It is immediately visible from the plots that the 
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loss function defined in our estimation algorithm reaches its minimum for a = 1.69. 
We plot another two cases just for illustration of what our estimate looks like at the 
critical levels. After minimizing the loss function and choosing the tail exponent, we 
derive the finite sample properties of the estimate. From Table 4, we can see that 
1.55 and 1.87 are the 0.025% and 97.5% quantiles. 

Unlike the classical Hill estimation procedure, we can see that our method is not 
sensitive to the choice of k. Moreover, it is a much stronger result for the tail 
exponent, as it accounts for all possible fc's, not only the one chosen. Using the 
classical Hill estimation procedure, we would select k and compute the tail exponent, 
but as we discussed in the previous text, there is always a trade-off between bias and 
variance when choosing the optimal k. Our method reduces this problem. Finally, 
the sample on which we estimate the tail exponent is quite small. In previous sections 
we showed that Hill estimation does not work well for such small samples, but our 
method works fine. 

6. Conclusion 

In this paper, we have researched the finite sample behavior of the Hill estimator 
under a— stable distributions and we introduce a Monte Carlo-based method of es- 
timation for the tail exponent. 

First of all, we provide Monte Carlo confidence intervals for the Hill estimator under 
different a's and k on the interval k e (0, 20%] of sample size 10 3 up to 10 6 . We also 
provide the optimal range of k for the series length 10 6 . Based on our simulations, we 
conclude that in order to estimate the true tail exponent using the Hill estimator, we 
need a dataset of at least 10 6 length, i.e., high frequency data, otherwise inference of 
the tail exponent may be strongly misleading and rejection of the Levy stable regime 
not appropriate. 

In the second part of the paper, we utilize the results and introduce a tail exponent 
procedure based on Monte Carlo simulations. It is based on the idea of simulating 
a large grid of random variables from the Levy stable distribution with different a 
exponents and estimating the tail exponent using the Hill estimator for all different 
k. After obtaining the behavior of the Hill estimator on this large grid of different tail 
exponents and k, all we need to do is to estimate the tail exponent on the empirical 
dataset for all fc's and compare all the values with the simulated grid. Using this 
algorithm we get estimates of the tail exponent. 

In comparison with the Hill estimator, or any other tail exponent estimator, our 
method has several advantages. The largest advantage is that it is not sensitive 
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to the choice of k. Moreover, the method works well for small samples. The new 
estimator gives unbiased results with symmetrical confidence intervals. 

Finally, we illustrate the power of our tail exponent estimation method on an empir- 
ical dataset. We use daily returns of leading world stock markets: the German DAX, 
the U.S. S&P 500 and Dow Jones Industrial (DJI), the London FTSE, the Nikkei 
225, and the Singapore Straits Times from the beginning of 2002 to the end of 2009. 
We divide this period into two equal sub-periods and compare the tail exponents 
estimated using our method. 
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Figure 9. Histograms of all indices for the first period compared with 
the standard normal distribution 
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Figure 10. Histograms of all indices for the second period compared 
with the standard normal distribution 



